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This  report  presents  the  rationale,  development,  and  standardization  of  the  Air  Force  Officer 
Qualifying  Test  (AFOQT)  Form  0.  The  AFOQT  is  used  to  select  Individuals  for  officer  commis¬ 
sioning  programs,  and  to  select  candidates  for  pilot  and  navigator  training.  AFOQT  Form  0 
contains  380  items  (many  from  previous  versions  of  the  AFOQT)  organized  In  16  subtests.  All 
Items  are  administered  In  a  single  test  booklet,  accompanied  by  a  single  machlne-scorable  answer 
sheet.  Five  composites,  expressed  as  percentiles,  are  derived  from  various  combinations  of  the 
subtests:  Pilot,  Navigator-Technical,  Academic  Aptitude,  Verbal,  and  Quantitative.  There  Is  a 
single  raw-score-to-percentlle  conversion  table  for  each  composite.  Although  Form  0  contains  the 
same  composites  as  predecessor  forms  of  the  AFOQT,  this  form  Introduced  changes  In  content, 
format,  administration,  and  scoring.  Reliabilities  and  Intercorrelatlons  are  consistent  with 
those  of  previous  forms. 

Form  0  was  standardized  to  link  scores  to  a  normative  group  based  on  Form  N  data.  The  use  of 
common  Items  on  Form  N  and  Form  0  enabled  a  series  of  equl percentile  equatlngs  through  the  coamnn 
Items,  resulting  In  the  equating  of  the  entire  Form  0  battery  to  the  entire  Form  N  battery.  Raw- 
score-to-percentlle-score  Form  0  composite  conversion  tables  were  developed  from  these  equatlngs, 
and  were  Implemented  for  operational  use  with  the  Introduction  of  Form  0  In  September  1981  at 
Officer  Training  School  testing  sites  and  In  March  1982  at  Air  Force  Reserve  Officer  Training 
Corps  testing  sites.  The  data  Indicate  the  test  Is  an  appropriate  replacement  for  Form  N  as  a 
selection  Instrument  for  Air  Force  officers. 


PREFACE 


This  work  was  accomplished  under  work  unit  number  77191847,  "Development  and 
Validation  of  Civilian  and  Non-Rated  Officer  Selection  Methodologies."  This  work  unit 
was  established  In  response  to  Air  Force  Regulation  (AFR)  35-8,  Air  Force  Military 
Testing  Program. 
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Dr.  Lonnie  Valentine,  Lt  Col  Lawrence  Short,  Dr.  Malcolm  Ree,  Ms.  Jacoblna  Skinner,  lLt 
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AIR  FORCE  OFFICER  QUALIFYING  TEST  FORM  0: 
DEVELOPMENT  AND  STANDARDIZATION 


I.  BACKGROUND 

The  Air  Force  Officer  Qualifying  Test  (AFOQT)  Is  used  to  select  Individuals  for  officer 
commissioning  programs,  such  as  Officer  Training  School  (OTS)  and  Air  Force  Reserve  Officer 
Training  Corps  (AFROTC).  It  is  also  used  to  select  candidates  for  specific  training  programs 
such  as  pilot  and  navigator  training.  New  forms  of  the  AFOQT  are  developed  on  a  periodic  basis, 
usually  a  3-year  cycle,  to  update  test  Information,  to  modify  test  content  as  needed,  and  to 
reduce  the  potential  for  test  compromise.  This  report  describes  the  development  and  standard¬ 
ization  of  AFOQT  Form  0,  which  replaced  Its  predecessor  (AFOQT  Form  N)  in  September  1981  at  OTS 
testing  sites  and  in  March  1982  at  AFROTC  testing  sites. 

The  precursor  of  the  AFOQT  was  the  Aircrew  Classification  Battery  (ACB),  a  test  developed  at 
the  onset  of  World  War  II  to  select  and  classify  individuals  into  aircrew  training.  In  1951, 
selected  subtests  of  the  ACB  were  combined  with  an  experimental  aptitude  test  called  the  Aviation 
Cadet— Officer  Candidate-Qualifying  Test.  The  resulting  combination  was  the  AFOQT  Form  A.  The 
co^rosition  and  use  of  the  AFOQT  have  changed  over  the  15  different  forms  of  the  test;  however, 
the  test  has  remained  the  basis  of  the  Air  Force  officer  selection  testing  program  to  the 
present.  Five  aptitude  composites  have  been  used  throughout  the  history  of  the  test:  Pilot, 
Navigator-Technical  (formerly  Observer-Technical),  Academic  Aptitude  (formerly  Officer  Quality), 
Verbal,  and  Quantitative.  Although  the  aptitudes  being  measured  by  the  composites  have  remained 
constant,  the  subtests  making  up  the  composites  have  changed  over  the  years.  These  changes  are 
documented  In  papers  describing  the  history  of  the  AFOQT  and  the  development  of  earlier  forms 
(Gould,  1978;  Miller,  1966,  1968,  1970,  1972,  1974;  Miller  t  Valentine,  1964;  Rogers,  Roach,  t 
Short,  1986;  Valentine  A  Creager,  1961). 

Prior  to  1960,  the  AFOQT  was  the  selection  test  used  for  all  commissioning  sources.  Since 
then.  Scholastic  Aptitude  Test  (SAT)  scores  have  been  the  primary  aptitude  selection  tool  for  the 
United  States  Air  Force  Academy  (USAFA).  The  AFOQT  is  still  used  to  select  candidates  (usually 
college  graduates  or  prior  enlistees)  for  OTS,  to  select  AFROTC  cadets  for  scholarships  or  Into 
the  Professional  Of"cers  Course,  and  to  select  candidates  for  Undergraduate  Pilot  Training  (UPT) 
and  Undergraduate  rigator  Training  (UNT). 

AFOQT  result'  re  reported  in  terms  of  composite  percentile  scores.  Conversion  tables  are 
used  to  convert  i  scores  on  the  composites  to  percentiles.  For  some  previous  forms,  different 
conversion  table  have  been  used  for  males  and  females  and  for  those  of  different  educational 
backgrounds  (se<  Rogers  et  al.,  1986).  With  the  implementation  of  Form  0  in  1981,  a  single 
conversion  table  was  developed  for  each  composite. 


II.  GENERAL  CHARACTERISTICS 

AFOQT  Form  0  consists  of  380  Items  organized  Into  16  subtests.  These  subtests  are  used  in 
one  or  more  of  five  aptitude  composites:  Pilot,  Navigator-Technical,  Academic  Aptitude,  Verbal, 
and  Quantitative.  Table  1  shows  the  subtests  and  how  they  are  organized  into  composites.  The 
test  requires  approximately  4.5  hours  to  administer.  Examinees  are  required  to  take  all  parts  of 
the  AFOQT.  One  conversion  table  Is  used  for  each  of  the  five  composites  to  convert  all 
applicants'  scores  to  percentiles. 


Table  1 .  Content  and  Organization  of  AFOQT  Fora  0 


Subtests 

No.  of 
Items 

P 

N-T 

Composites8 

AA  V 

0 

Verbal  Analogies  » 

25 

X 

X 

X 

Arithmetic  Reasoning 

25 

X 

X 

X 

Reading  Comprehension 

25 

X 

X 

Data  Interpretation 

25 

X 

X 

X 

Word  Knowledge 

25 

X 

X 

Math  Knowledge 

25 

X 

X 

X 

Mechanical  Comprehension 

20 

X 

X 

Electrical  Maze 

20 

X 

X 

Scale  Reading 

40 

X 

X 

Instrument  Comprehension 

20 

X 

Block  Counting 

20 

X 

X 

Table  Reading 

40 

X 

X 

Aviation  Information 

20 

X 

Rotated  Blocks 

15 

X 

General  Science 

20 

X 

Hidden  Figures 

15 

X 

Total 

380 

205 

265 

150 

75 

75 

aP  «  Pilot;  N-T  =  Navigator-Technical;  AA  =  Academic  Aptitude; 
V  ■  Verbal;  and  Q  =  Quantitative. 


Fora  0  Is  printed  as  a  single  test  booklet.  Air  Force  Personnel  Test  (AFP1)  982,  with  an 
accompanying  machlne-scannable  answer  sheet.  Two  sites  are  used  to  computer-score  answer 
sheets— the  Air  Force  Human  Resources  Laboratory  at  Brooks  AFB,  Texas  (to  score  answer  sheet  AFPT 
987),  and  Headquarters  AFROTC  at  Maxwell  AFB,  Alabama  (to  score  answer  sheet  AFROTC  PTF  987). 
Answer  sheets  are  scored  at  Maxwell  AFB  for  individuals  who  test  at  AFROTC  sites.  Most,  but  not 
all,  of  these  examinees  are  applying  for  a  comnlsslon  through  AFROTC.  Answer  sheets  are  scored 
at  Brooks  AFB  for  Individuals  tested  at  all  non-AFROTC  sites.  A  majority  of  these  examinees  are 
applying  for  a  comnlsslon  through  OTS.  The  administration  manual  Is  AFPT  980  for  non-AFROTC 
sites  and  AFROTC  PT  980  for  AFROTC  sites.  On  all  subtests,  only  correct  answers  are  scored.  All 
subtests  have  five  Item-options,  except  for  Instrument  Comprehension  which  has  four. 


III.  TEST  MODIFICATIONS  OVER  RECENT  FORMS 

Fora  0  differs  from  Its  most  recent  predecessor.  Fora  N,  in  content,  format,  administration, 
and  scoring.  Form  N  was  composed  of  606  Items  divided  Into  18  subtests.  For  Fora  0,  the  number 
of  Items  was  reduced  to  380,  and  the  number  of  subtests  was  reduced  to  16.  To  create  Fora  0, 
four  subtests  (Pilot  Biographic  and  Attitude  Scale,  Aerial  Landmarks,  Tools,  and  Background  for 
Current  Events)  that  were  Included  In  Form  N  were  removed;  one  subtest  (Aviation  Information) 
that  had  been  used  prior  to  Form  N  was  reinstated;  and  one  new  subtest  (Hidden  Figures)  was 
added.  The  16  subtests  In  Form  0  make  up  the  same  five  composites  derived  from  Fora  N;  however, 
the  Officer  Quality  composite  was  renamed  Academic  Aptitude.  This  was  done  to  prevent 
misinterpretation  of  what  the  composite  Is  Intended  to  measure.  Table  2  shows  the  subtests  In 
Fora  N  and  how  they  were  organized  Into  composites.  Six  of  the  subtests  In  Form  N  were 


designated  as  speeded  subtests.  For  Fom  0,  none  of  the  subtests  Is  specifically  designated  as 
speeded  since  all  subtests  contain  elements  of  both  power  and  speed.  Refer  to  Section  IV,  Item 
Selection,  for  more  Information  on  how  speededness  affected  Item  analysis. 


Table  2.  Content  and  Organization  of  AFOQT  Form  N 


Composites3 

Subtests 

Items 

P 

N-T 

OQ 

V 

Q 

Booklet  1 

Arithmetic  Reasoning 

25 

X 

X 

X 

Nath  Knowledge 

25 

X 

X 

X 

Data  Interpretation 

25 

X 

X 

X 

Booklet  2 

Word  Knowledge 

25 

X 

X 

Reading  Comprehension 

25 

X 

X 

Background  for  Current 

25 

X 

X 

Events 

Verbal  Analogies 

25 

X 

X 

X 

Booklet  3 

Table  Reading15 

50 

X 

X 

Electrical  Maze*5 

30 

X 

X 

Block  Counting15 

80 

X 

X 

Scale  Reading*5 

48 

X 

X 

Tools 

25 

X 

X 

Mechanical  Comprehension 

24 

X 

X 

Booklet  4 

Rotated  Blocks 

20 

X 

Aerial  Landmarks*5 

40 

X 

General  Science 

24 

X 

Instrument  Comprehension*5 

24 

X 

Pilot  Biographic  and 

66 

X 

Attitude  Scale 

Total 

606 

372 

416 

175 

100 

75 

aP  *  Pilot;  N-T  =  Navigator-Technical;  OQ  =  Officer  Quality; 


V  =  Verbal;  and  Q  *  Quantitative. 
^Speeded  Subtests. 


Form  N  subtests  were  printed  In  four  test  booklets,  accompanied  by  three  double-sided  answer 
sheets.  The  answer  sheets  were  hand-scored  using  10  scoring  keys.  Form  0  differs  In  format,  in 
that  all  16  subtests  are  contained  In  a  single  test  booklet,  which  Is  accompanied  by  a  single 
■achlne-scannable  answer  sheet.  The  use  of  a  single  answer  sheet  was  designed  to  reduce  the 
number  of  Incomplete  test  scores  due  to  missing  or  lost  answer  sheets  and  to  eliminate  answer 
sheet  matching  problems  resulting  from  transcription  errors  In  social  security  account  numbers. 

Test  adslnl  strati  on  differences  between  Form  0  and  Its  predecessors  resulted  directly  from 
the  reduction  In  the  length  of  the  test  battery.  The  use  of  a  less  complex  set  of  testing 
materials  shortened  the  total  time  for  test  administration  from  about  7  hours  to  about  4.5 
hours.  This  enabled  applicants  to  take  all  16  Form  0  subtests.  For  previous  forms,  time 


restrictions  made  It  necessary  to  require  only  applicants  who  were  applying  for  UPT  or  UNT  to 
take  the  subtests  unique  to  the  Pilot  and  Navigator-Technical  composites.  Unlike  previous  forms 
then,  all  composite  scores  are  available  for  applicants  testing  on  Form  0. 


Between  Form  N  and  Form  0,  scoring  procedures  changed  In  three  ways.  First,  because  of  the 
use  of  a  machlne-scannable  answer  sheet  for  Form  0,  computer-based  scoring  was  Initiated  as  a 
replacement  for  the  previous  Inefficient,  error-prone  hand-scoring  method  used  with  Form  N. 
Second,  speeded  subtests  In  Form  N  were  corrected  for  guessing.  The  Instrument  Comprehension 
subtest  was  scored  as  the  number  of  right  answers  minus  one-third  of  the  wrong  answers,  and  other 
speeded  subtests  were  scored  as  the  number  of  right  answers  minus  one-fourth  of  the  wrong 
answers.  All  subtests  In  Form  0  are  scored  as  number  right  only.  Finally,  Form  N  used  multiple 
tables  for  converting  each  composite's  raw  scores  Into  percentile  scores.  These  conversion 
tables  were  based  on  level  of  education,  to  take  Into  account  effects  of  education  on  AFOQT 
performance.  The  tables  were  established  for  two  categories  of  applicants:  those  with  less  than 
2  years  of  college  and  those  with  2  or  more  years  of  college.  A  decision  was  made  with  the 
Implementation  of  Form  0  to  put  all  scores  In  a  single  metric  (see  Roach,  1986).  This  reduced 
the  number  of  conversion  tables  to  five  (one  for  each  composite). 


IY.  ITEM  SELECTION 

Over  90*  of  the  Items  selected  for  Form  0  were  obtained  from  previous  versions  of  the  AFOQT. 
The  sources  of  the  Items  In  Form  0  are  provided  In  Table  3.  Form  N  Items  were  used  extensively 
In  order  to  link  Form  0  to  the  normative  group  (refer  to  Section  VI— Standardization). 


Table  3.  Source  of  Items  for  AFOQT  Form  0 


Items  selected  from  AFOQT  Form: 


AFOQT  Form  0 


L  Only  N  Only  LSM  MM 


LSMM  New  Total 


(Nonoverlapping) 

%  1 

Pilot 

Composite 

n 

%  1 

Navigator-Technical 

Composite 

n 

* 

Academic  Aptitude 
Composite 

n 

*  1 

Verbal 

Composite 

n 

*  ; 

Quantitative 

Composite 

n 

*  1 

8.0  12.0 


Each  form  of  the  AFOQT  Is  designed  to  have  a  level  of  difficulty  similar  to  that  of  the 
preceding  form.  Difficulty  levels  and  biserial  correlations  were  computed  for  Items  In  each  of 
the  subtests,  using  a  sample  of  37,409  applicants  (6,944  from  AFROTC  sites,  and  30,465  from 
non-AFROTC  sites)  who  were  tested  on  Form  0.  The  sample  contained  approximately  88*  males  and 
12*  females.  By  race,  79*  were  White,  13%  were  Black,  and  5%  were  Hispanic.  Applicants  from 
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AFROTC  testing  sites  have  typically  completed  12  or  13  years  of  school  and  are  18  or  19  years 
old.  Applicants  froa  non -AFROTC  testing  sites  are  usually  22-  to  25-year-old  college  graduates. 
The  difficulty  levels  were  computed  as  the  proportion  of  exaalnees  who  answered  the  Itea 
correctly.  Thus,  the  more  difficult  the  Itea,  the  lower  the  value.  In  a  very  few  cases,  Itea 
difficulties  are  below  chance.  There  are  not  enough  of  these  to  negatively  affect  subtest 
scores.  Blserlal  correlations  represent  the  relationship  between  the  correct  response  on  an  Itea 
and  the  total  score  on  the  subtest  In  which  the  Itea  Is  Included.  These  correlations  were  not 
corrected  for  1  tea-test  overlap. 

Due  to  testing  tlae  Halts  established  for  the  subtests,  a  large  mater  of  exaalnees  do  not 
coaplete  all  of  the  Iteas  on  a  majority  of  subtests.  This  aeans  that  aost  of  the  subtests  are, 
to  soae  extent,  speeded  In  nature.  The  degree  of  speededness  varies  froa  subtest  to  subtest  and 
ranges  froa  speed  of  perforaance  having  a  slight  lapact  to  a  heavy  lapact  on  subtest  score.  None 
of  the  subtests,  however,  can  be  considered  totally  speeded.  That  Is,  even  on  the  highly  speeded 
subtests,  power  plays  a  part  because  all  applicants  who  atteapt  each  Itea  do  not  necessarily 
answer  It  correctly.  For  this  reason,  none  of  the  Fora  0  subtests  has  been  specifically 
designated  as  speeded.  Although  the  same  speededness  considerations  have  similarly  applied  to 
previous  forms  of  the  AFOQT,  this  change  Is  being  made  to  better  represent  the  actual  situation. 

To  get  the  aaxlaum  amount  of  Information  about  each  subtest,  difficulty  levels  and  biserial 
correlations  were  computed  for  each  Item,  based  only  on  applicants  who  reached  that  Item.  The 
range  and  median  difficulty  levels  and  biserial  correlations  for  each  subtest  are  presented  In 
Table  4.  In  addition,  to  indicate  the  relative  speededness  of  each  subtest,  the  number  of  Items 
not  reached  by  SI  and  20*  of  the  applicants  Is  tabled.  From  these  data.  It  may  be  seen  that 
subtests  such  as  Table  Reading,  Electrical  Naze,  and  Scale  Reading  are  relatively  more  speeded, 
whereas  subtests  such  as  Mechanical  Comprehension  and  Rotated  Blocks  are  relatively  less  speeded. 

Table  4.  Item  Difficulty  Levels  and  Blserlal  Correlations  for  AFOQT  Form  0 


Subtest 

Dlfflcul' 

Range 

ty  level 
Median 

Blserlal 
correlation 
Range  Median 

No.  of  items 
not  reached  by 
t  applicants* 
5*  208 

Verbal  Analogies 

.29-.  91 

.65 

.42-. 75 

.58 

6 

2 

Arithmetic  Reasoning 

.24-.  81 

.57 

.38-. 68 

.61 

9 

4 

Reading  Comprehension 

.47-. 82 

.69 

.45-. 80 

.67 

10 

4 

Data  Interpretation 

.14-.  89 

.49 

.31  -.60 

.45 

12 

7 

Word  Knowledge 

.24-. 83 

.56 

.44-.  78 

.65 

10 

0 

Math  Knowledge 

.37-.  81 

.54 

.43-.  81 

.58 

13 

4 

Mechanical  Comprehension 

.31 -.83 

.47 

.24-. 66 

.53 

0 

0 

Electrical  Maze 

.18-.  67 

.41 

.37-. 76 

.59 

15 

10 

Scale  Reading 

.29-. 92 

.53 

.27-. 60 

.46 

19 

13 

Instrument  Comprehension 

.34-.  65 

.50 

.43-. 76 

.59 

13 

8 

Block  Counting 

.30-. 90 

.61 

.43-. 73 

.62 

12 

8 

Table  Reading 

.22-. 94 

.84 

.42-. 86 

.70 

23 

16 

Aviation  Information 

.23-. 72 

.47 

.38-. 72 

.56 

5 

0 

Rotated  Blocks 

.25-. 85 

.43 

.57-.  73 

.65 

0 

0 

General  Sciences 

.10-.  78 

.44 

.29-, 71 

.49 

3 

0 

Hidden  Figures 

.34-. 92 

.66 

.49-. 67 

.60 

9 

2 

aRefer  to  Table  1  for  the  number  of  Items  in  each  subtest. 
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V.  RELIABILITY,  INTERCORRELATIONS,  AND  VALIDITY 


Subtest  and  composite  reliabilities  and  Intercorrelatlons  were  coaputed  on  Fora  0  data,  using 
the  coablned  AFROTC  (n  -  6,944)  and  non-AFROTC  (n  -  30,465)  sample  of  37,409  cases.  As  would  be 
expected  of  two  similar  versions  of  the  same  test,  the  results  are  slitllar  to  the  data  obtained 
on  the  predecessor  test.  Form  N  (Gould,  1978).  Internal  consistency  reliability  coefficients 
were  derived  for  the  subtests  using  the  Kuder-RIchardson  Formula  20.  The  subtests  were  then 
combined  to  fora  the  five  composites,  and  the  reliabilities  of  the  coaposltes  were  based  on  a 
formula  developed  by  Hherry  and  Gaylord  (1943).  Tables  5  and  6  present  the  reliabilities  and 
standard  errors  of  measurement  of  Form  0  subtests  and  composites,  respectively.  Raw  scores  were 
used  In  all  computations.  It  should  be  noted  that  the  reliability  estimates  may  be  overestimated 
due  to  the  speededness  of  the  subtests.  Internal  consistency  Is  artificially  enhanced  by 
treating  Items  not  reached  by  applicants  as  Incorrect  (see  Table  4  for  an  account  of  how  many 
Items  are  affected  for  each  subtest).  Available  data  make  It  Impossible  to  estimate  lower-bound 
reliability  estimates. 

Table  S.  Reliabilities  and  Standard  Errors  of  Measurement  (SEM) 
for  AFOQT  Form  0  Subtests 


Subtest 

Reliability 

SEM 

Verbal  Analogies 

.796 

1.915 

Arithmetic  Reasoning 

.804 

1.944 

Reading  Comprehension 

.885 

2.031 

Data  Interpretation 

.719 

2.104 

Word  Knowledge 

.882 

2.013 

Math  Knowledge 

.867 

2.144 

Mechanical  Comprehension 

.712 

1.975 

Electrical  Maze 

.809 

1.822 

Scale  Reading 

.839 

2.700 

Instrument  Comprehension 

.844 

1.912 

Block  Counting 

.837 

1.793 

Table  Reading 

.925 

1.197 

Aviation  Information 

.794 

1.961 

Rotated  Blocks 

.769 

1.600 

General  Science 

.699 

1.992 

Hidden  Figures 

.701 

1.547 

Table  6.  Reliabilities  and  Standard  Errors  of  Measurement  (SIM) 
for  AFOQT  Form  0  Composites 


Composite 

Reliability 

SEM 

Pilot 

.964 

5.395 

Navigator-Technical 

.967 

6.657 

Academic  Aptitude 

.959 

4.963 

Verbal 

.944 

3.455 

Quantitative 

.919 

3.575 

Intercorrelatlon  matrices  based  on  Pearson  correlation  coefficients  were  computed  for 
subtest  raw  scores  (Table  7)  and  composite  raw  scores  (Table  8).  Although  subtest 
Intercorrelatlons  vary  considerably  (from  .169  to  .729),  composite  Intercorrelatlons  are 
consistently  high  (greater  than  .600).  High  composite  Intercorrelatlons  are  In  part  traceable  to 


the  overlap  of  subtests  In  composites,  especially  Verbal  and  Quantitative  with  Academic  Aptitude 
and  Pilot  with  Navigator-Technical  (see  Table  1). 


Validity  InfonMtlon  Is  available  for  recent  fones  of  the  AFOQT  In  studies  predicting  success 
In  non-rated  officer  technical  training  courses  (Arth,  1985),  air  weapons  controller  training 
(Flnegold  A  Rogers,  1985),  UPT  (Bordelon  A  Kantor,  1986),  and  UNT  (Shanahan  A  Kantor,  1986). 
Results  generally  show  AFOQT  scores  to  predict  performance  best  In  classroom  training. 
Validities  are  expected  to  remain  constant  across  forms  because  of  the  stability  of  the 
composites.  Validity  Information  obtained  on  AFROTC  cadets  who  took  AFOQT  Form  0  Is  available 
showing  the  relationship  between  AFOQT  composite  scores  and  scores  on  the  SAT,  American  College 
Test,  and  school  grade  point  average  (Diehl,  1986).  Results  are  encouraging,  suggesting  some 
unique  measurement  properties  of  the  AFOQT  (for  example,  the  Pilot  composite  correlates  less  than 
.45  with  any  non-AFOQT  measure),  but  showing  overlap  where  It  would  be  expected.  SAT-Verbal  and 
SAT-Math,  for  example,  have  strong  correlations  with  their  corresponding  AFOQT  composite 
counterparts  (Verbal  and  Quantitative),  and  much  lower  correlations  with  noncorresponding 
counterparts. 


Table  7.  Sub  test  Intercorrelatlons*  for  AFOQT  Form  0 


VA 

AR 

RC 

DI 

UK 

MK 

MC  EM  SR 

IC 

BC 

TR 

AI 

RB 

b 

VA 

AR  .566 

RC  .729 

.563 

DI  .536 

.672 

.557 

WK  .682 

.451 

.769 

.462 

MK  .534 

.711 

.505 

.603 

.404 

MC  .476 

.497 

.465 

.466 

.388 

.477 

EM  .265 

.362 

.241 

.376 

.169 

.377 

.443 

SR  .481 

.681 

.462 

.636 

.361 

.624 

.497  .443 

IC  .368 

.406 

.346 

.448 

.284 

.379 

.502  .422  .502 

BC  .425 

.509 

.388 

.502 

.297 

.486 

.486  .  467  .  612 

.498 

TR  .344 

.450 

.355 

.466 

.261 

.453 

.313  .321  .556 

.372 

.519 

AI  .340 

.316 

.365 

.359 

.331 

.264 

.508  .283  .363 

.581 

.316 

.242 

RB  .404 

.453 

.329 

.408 

.257 

.459 

.544  .412  .499 

.466 

.542 

.347 

.350 

GS  .510 

.473 

.536 

.437 

.507 

.525 

.570  .336  .424 

.420 

.365 

.263 

.465 

.409 

HF  .363 

.368 

.319 

.372 

.262 

.370 

.383  .337  .443 

.358 

.450 

.363 

.255 

.428 

*A11  Intercorrelatlons  are  statistically  significant  at  2.  <  .05. 

bVA  -  Verbal  Analogies;  AR  ■  Arithmetic  Reasoning;  RC  *  Reading  Comprehension;  DI  -  Data 
Interpretation;  NX  -  Word  Knowledge;  MK  -  Math  Knowledge;  MC  ■  Mechanical  Comprehension; 
EM  ■  Electrical  Maze;  SR  -  Scale  Reading;  IC  •  Instrument  Comprehension;  6C  *  Block  Counting; 
TR  -  Table  Reading;  AI  «  Aviation  Information;  RB  -  Rotated  Blocks;  GS  -  General  Science;  and 
HF  ■  Hidden  Figures. 


Table  8.  Composite  Intercorrelatlons*  for  AFOOT  Fora  0 


Coaposlte 

P 

N-T 

AA 

V 

Pilot  (P) 

Nav-Tech  (N-T) 

.945 

Acadealc  Aptitude  (AA) 

.753 

.821 

Verbal  (V) 

.615 

.615 

.917 

Quantitative  (Q) 

.754 

.887 

.886 

.627 

aA11  Intercorrelatlons  are  statistically  significant  atj>.<  >05. 


VI.  STANDARDIZATION 

The  normative  group  for  AFOQT  Fora  0  scores  Is  a  saaple  of  basic  alraen,  AFROTC  students,  OTS 
candidates,  USAFA  cadets,  and  junior  officers,  all  of  whoa  were  tested  on  AFOQT  Fona  N  (see 
Gould,  1978,  for  aore  Inforaatlon  about  the  saaple).  Fora  0  scores  were  linked  to  Fora  N  scores 
and  to  the  noraatlve  group  using  equl percentile  equating  of  foras  with  coaaon  Iteas  (Angoff, 
1971).  This  was  accoapllshed  using  the  274  Iteas  which  overlap  between  Foras  N  and  0.  This  Is 
the  first  fora  of  the  AFOQT  to  be  equated  to  an  anchor  test  (Fora  N)  through  the  use  of  coaaon 
Iteas. 

A  three-step  process  was  used  to  equate  Fora  0  to  Fora  N  (see  Figure  1).  In  the  first  step. 
Fora  N  Itea  responses  were  obtained  for  a  saaple  of  applicants  to  the  Alraen  Education  and 
Coaailsslonlng  Prograa  (AECP,  n  -  7,047),  OTS  (n  *  4,581),  and  AFROTC  (n  *  2,742).  Of  the 
original  saaple,  about  6,000  coapleted  all  of  the  subtests;  the  reaalnder  coapleted  only  the 
subtests  required  for  the  Officer  Quality,  Verbal,  and  Quantitative  coaposltes.  These  data  were 
weighted  to  represent  the  applicant  population  In  the  late  1970s  by  type  of  application  prograa 
( approx laately  ltt  AECP,  50K  OTS,  and  408  AFROTC).  Coaposlte  scores  for  this  saaple  were 
coaputed  based  on  all  606  Iteas  In  Fora  N  (referred  to  as  Full  Battery,  or  FB)  and  on  the  274 
Fora  N  Iteas  that  were  used  In  Fora  0  (referred  to  as  Short  Battery,  or  SB).  Using  this  saaple, 
the  SB  Fora  N  scores  were  equated  to  FB  Fora  N  scores,  and  SB  Fora  N  raw-score-to-percentlle 
conversion  tables  were  developed  for  each  coaposlte.  These  tables  therefore  provided  conversions 
froa  SB  Fora  N  raw  coaposlte  scores  to  percentiles,  based  on  the  original  Fora  N  noraatlve  group. 


Noraatlve 

FB  Fora  N 

FB  Fora  0 

Group 

606  -  Equated 

-  SB  Fora  H  - 

SB  Fora  0  - 

Equated  - 

368 

Iteas 

274 

274 

Iteas 

Items 

Iteas 

Anchor  Test 

Coaaon  Iteas 

Common  Iteas 

New  Test 

1 

_ 1  l _ 

_ 1  1 _ 

_ | 

Step  1 

Step  2 

Step  3 

Figure  1.  Equating  through  Coaaon  Iteas  to  Noraatlve  Group 


Because  the  SB  Fora  N  Items  were  contained  In  Form  0,  the  second  step  Involved  adopting  the 
SB  Fora  N  conversion  tables  for  Fora  0.  Ready-aade  conversion  tables  tied  to  the  Fora  N 
noraatlve  group  were  thus  available,  beginning  with  the  laplementatlon  of  Fora  0  In  1981,  by 
scoring  the  274  Iteas  In  Fora  0  that  were  coaaon  to  Fora  N  and  using  the  SB  Fora  N  composite 
conversion  tables.  These  tables,  which  can  also  be  labeled  SB  Fora  0  conversion  tables,  are 
presented  In  Appendix  A. 

The  third  step  In  equating  Fora  0  to  Fora  N  used  equlpercentlle  equating  of  FB  Form  0 
scores1  to  SB  Fora  0  scores  to  link  these  scores  to  the  noraatlve  group.  The  saaple  used  for 
this  equating  consisted  of  Air  Force  officer  applicants  who  tested  on  Fora  0  at  AFROTC  (n  * 
6,944)  and  non-AFROTC  (n  -  30,465)  test  sites.  This  Is  the  same  sample  that  was  used  for  the 
Item  analyses  (Section  IV)  and  the  reliabilities  and  Intercorrelatlons  (Section  V).  Item 
responses  were  used  to  calculate  FB  and  SB  Fora  0  scores  for  each  applicant.  Heights  were 
applied  prior  to  the  equating  so  that  applicants  tested  at  AFROTC  and  non-AFROTC  sites  would  be 
equally  represented.  The  resulting  FB  Fora  0  raw-score-to-percentlle  composite  conversion  tables 
are  presented  In  Appendix  6.  These  tables  were  Implemented  operationally  on  24  January  1984. 
Prior  to  that  date,  reported  AFOQT  percentile  scores  for  applicants  who  took  Fora  0  were  based  on 
the  SB  Fora  0  composite  conversion  tables. 


VII.  CONCLUSIONS 

A  new  fora  of  the  AFOQT  (Fora  0)  has  been  developed  and  standardized  through  the  use  of 
coaaon  Items  between  Forms  N  and  0.  Tables  are  presented  to  convert  Fora  0  raw  composite  scores 
to  percentiles  referenced  to  a  Fora  N  noraatlve  group.  The  results  from  Item  analyses  and  the 
Intercorrelatlons  and  Internal  consistencies  of  the  subtests  and  composites  Indicate  the  test  Is 
an  appropriate  replacement  for  Form  N  as  a  selection  Instrument  for  Air  Force  officers. 
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APPENDIX  Ai  AFOQT  FORM  0  SHORT  BATTERY  CONVERSION  TABLES 


Table  A-l. 


Fora  0  Pilot  Composite  Short  Battery  Conversion  Table 


Raw 

score 

Percentile 

Raw 

score 

Percentile 

Raw 

score  Percentile 

0-33 

01 

75 

30 

103 

69 

34-38 

02 

76 

31 

104 

70 

39-41 

03 

77 

33 

105 

71 

42-44 

04 

78 

34 

106 

73 

45-46 

05 

79 

35 

107 

74 

47-48 

06 

80 

37 

108 

75 

49-50 

07 

81 

38 

109 

77 

51 

08 

82 

39 

110 

78 

52-53 

09 

83 

41 

111 

79 

54 

10 

84 

42 

112 

81 

55-56 

11 

85 

43 

113 

82 

57 

12 

86 

45 

114 

83 

58-59 

13 

87 

46 

115 

84 

60 

14 

88 

47 

116 

85 

61 

15 

89 

49 

117 

86 

62 

16 

90 

50 

118 

87 

63 

17 

91 

51 

119 

88 

64 

18 

92 

52 

120 

89 

65 

19 

93 

54 

121 

90 

66 

20 

94 

55 

122 

91 

67 

21 

95 

57 

123 

92 

68 

22 

96 

58 

124 

93 

69 

23 

97 

60 

125-126 

94 

70 

24 

98 

62 

127-128 

95 

71 

25 

99 

63 

129-130 

96 

72 

26 

100 

64 

131-133 

97 

73 

27 

101 

66 

134-135 

96 

74 

29 

102 

67 

136-158 

99 

Table  A-5.  Form  0  Quantitative  Composite  Short  Batteiy  Conversion  Table 


Raw 

score 


Nrcentlle 


Raw 

score 


Percentile 


Table  B-2.  Fora  0  Navigator-Technical  Composite  Full  Battery  Conversion  Table 


|  Raw 

Raw 

Raw 

score 

Percentile 

score 

Percentile 

score 

Percentile 

r  \ 

92-93 

94-95 

96-97 

98-99 


100-101 

102-103 

104-105 

106-107 


27 

28 

29 

30 
131-132 

33 


109-110 

20 

111-112 

21 

113 

22 

114 

23 

115 

24 

116-117 

25 

118 

26 

119 

27 

120 

28 

121 

29 

122-123 

30 

124 

31 

125 

32 

16 

126 

33 

176-177 


179-180 

181 

182-183 


187-188 

789-190 

191-192 

193-194 

195-196 


198-199 

200-202 

203-205 

206-209 

210-214 

215-218 

219-257 


£ 


r 


Table  B-4.  Fora  0  Verbal  Composite  Full  Battery  Conversion  Table 


I 


Raw 

score 


Percentile 


Raw 

score 


Percentile 


! 


« 

Table  B-5.  Fora  0  Quantitative  Composite  Full  Battery  Conversion  Table 


Raw 

score 


Percentile 


Raw 

score 


Percentile 


